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We present a measurement of the WW + W Z production cross section observed in a final state 
consisting of an identified electron or muon, two jets, and missing transverse energy. The measure- 
ment is carried out in a data sample corresponding to up to 4.6 fb _1 of integrated luminosity at 
\/s = 1.96 TeV collected by the CDF II detector. Matrix element calculations are used to separate 
the diboson signal from the large backgrounds. The WW + W Z cross section is measured to be 
17.4 ± 3.3 pb, in agreement with standard model predictions. A fit to the dijet invariant mass 
spectrum yields a compatible cross section measurement. 
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Measurements of the production cross section of pairs 
of heavy gauge bosons test the electroweak sector of the 
standard model (SM). The production cross section can 
be enhanced by anomalous triple gauge boson interac- 
tions [l| or from new particles decaying to pairs of vector 
bosons. 
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FIG. 1: Leading-order diagrams for WW s-channel (a) and 
t-channel (b) and W Z s-channel (c) and t-channel (d) pro- 
duction. 



In this paper, we describe the measurement of the 
WW + WZ production cross section in events contain- 
ing a high-pT electron or muon and two hadronic jets. 
This event topology is expected when one W boson in 
the event decays to an electron or muon and a neu- 
trino, and the other W or Z boson decays to two quarks 
(WW/WZ -> Ivqq). We consider both the WW and 
WZ processes as signal because our limited detector res- 
olution of hadronic jets makes the separation of W — > qq' 
from Z — > qq impracticable. 

The leading-order WW and WZ production diagrams 
are shown in Fig. [TJ The predicted SM production cross 
sections at the Tevatron, calculated at next-to-leading 
order (NLO), are a(pp -)• WW) = 11.66 ± 0.70 pb and 
a(pp ->• WZ) = 3.46 ±0.30 pb 0. Both of these produc- 
tion cross sections have been measured previously at the 
Tevatron in channels in which both gauge bosons decay 
leptonically @, [|, and no deviation between measure- 
ment and prediction has been observed. 

Hadronic decay modes have higher branching ratios 
than the leptonic decays, but the corresponding final 
states are exposed to large backgrounds. The first ob- 
servation of diboson production at the Tevatron with a 
hadronic decay was achieved in events with two jets and 
large missing transverse energy at CDF [f| . Evidence and 
observation of the process and decay discussed in this pa- 
per, WW + WZ — > ivqq, were previously reported by the 
DO Q and CDF collaborations. The observation re- 
ported by CDF used a matrix element technique relying 
on knowledge of the differential cross sections of signal 
and background processes to separate signal events from 
the background. 

The measurement of WW + WZ — > ivqq is relevant 
to the search for the Higgs boson at the Tevatron. One 



of the most powerful channels used in the search for a 
Higgs boson with a mass lower than 130 GeV/c 2 is the 
channel in which the Higgs boson is produced in asso- 
ciation with a W boson, with the Higgs boson decaying 
to a pair of b quarks and the W boson decaying lepton- 
ically (WH — >• ivbb). A similar matrix clement anal- 
ysis to the one presented in this paper is employed in 
the WH -» ivbb search at CDF Q. A well-established 
measurement of the WW + WZ channel gives us confi- 
dence in the similar techniques in the search for the Higgs 
boson. Similar issues in background modeling and sys- 
tematic uncertainties are relevant for the two analyses. 
One important difference, however, is that the search for 
WH production uses methods to identify jets originat- 
ing from b-quarks (6-tagging), whereas the WW + WZ 
analysis presented in this paper does not use 6-tagging. 

This paper presents the details of the matrix element 
method used in the observation of WW + WZ — > ivqq, 
but applied to a larger data sample corresponding to up 
to 4.6 fb _1 of integrated luminosity taken with the CDF 
II detector and with some changes in the event selec- 
tion criteria. In particular, the event selection has been 
made more inclusive so that it resembles that used in the 
WH — > ivbb search more closely. 

The organization of the rest of this paper is as follows. 
Section [TT] describes the apparatus used to carry out the 
measurement, while Section UTT1 describes the event selec- 
tion and backgrounds. The modeling of the signal and 
background processes is discussed in Section IIV1 Sec- 
tion [V] contains the details of the matrix element tech- 
nique used for the measurement. The systematic uncer- 
tainties and results are discussed in Sections I VII and IVIII 
respectively. A fit to the dijet invariant mass spectrum, 
performed as a cross check, is presented in Section [Villi 
Finally, we summarize the conclusions in Section [IX! 



II. CDF II DETECTOR 

The CDF II detector is a nearly azimuthally and 
forward-backward symmetric detector designed to study 
pp collisions at the Tevatron. It is described in detail in 
Ref . @ . It consists of a charged particle tracking system 
surrounded by calorimeters and muon chambers. Parti- 
cle positions and angles are expressed in a cylindric co- 
ordinate system, with the z axis along the proton beam. 
The polar angle, 9, is measured with respect to the di- 
rection of the proton beam, and </> is the azimuthal angle 
about the beam axis. The pseudo-rapidity, rj, is defined 
as 7] = — ln(tan | ) . 

The momentum of charged particles is measured by 
the tracking system, consisting of silicon strip detectors 
surrounded by an open-cell drift chamber, all immersed 
in a 1.4 T solenoidal magnetic field coaxial with the Teva- 
tron beams. The silicon tracking system [l(j consists of 
eight layers of silicon covering the radial region from 1.5 
cm to 28 cm from the beam axis. The drift chamber, or 
central outer tracker (COT) [ll[, is composed of eight 
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superlaycrs that alternate between axial and 2 degree 
stereo orientations. Each superlayer contains 12 sense 
wires. The COT covers the radial region from 40 cm to 
137 cm and provides good tracking efficiency for charged 
particles out to \rj\ < 1.0. 

The tracking system is surrounded by calorimeters 
which measure the energies of electrons, photons, and 
jets of hadronic particles. The electromagnetic calorime- 
ters use a scintillating tile and lead sampling technology, 
while the hadronic calorimeters are composed of scintil- 
lating tiles with steel absorber. The calorimeters are di- 
vided into central and plug sections. The central region, 
composed of the central electromagnetic (CEM) [l2| and 
central and end-wall hadronic calorimeters (CHA and 
WHA) [13], covers the region |7y| < 1.1. The end- 
plug electromagnetic (PEM) [l4| and end-plug hadronic 
calorimeters (PHA) extend the coverage to |?/| < 3.6. The 
calorimeters have a component called the shower maxi- 
mum (ShowerMax) [l5[ detector located at the depth in 
the calorimeter at which the electromagnetic shower is 
expected to be widest. The ShowerMax uses wire cham- 
bers and cathode strips to provide a precise position mea- 
surement for electromagnetic clusters. 

A muon system composed of planar multi-wire drift 
chambers records hits when charged particles pass 
through. Four different sections of the muon detector 
are used for the analysis presented here: the central muon 
detector (CMU) [16|, the central muon upgrade (CMP), 
the central muon extension (CMX), and the barrel muon 
chambers (BMU). In the central region, \rj\ < 0.6, four 
layers of chambers located just outside of the calorimeter 
make up the CMU system; the CMU is surrounded by 
60 cm of iron shielding and another four layers of cham- 
bers compose the CMP system. The CMX covers the 
region with 0.6 < \f]\ < 1.0, while the BMU extends the 
coverage to 1.0 < \t]\ < 1.5. 

Cherenkov luminosity counters (CLCs) fl7| measure 
the rate of inelastic collisions, which can be converted to 
an instantaneous luminosity. The integrated luminosity 
is calculated from the instantaneous luminosity measure- 
ments. The CLCs consist of gaseous Cherenkov counters 
located at high pseudo-rapidity, 3.6< |ry| <4.6. 

The three-level trigger system at CDF is used to re- 
duce the event rate from 1.7 MHz to about 150 Hz. 
The first level uses hardware, while the second is a mix- 
ture of hardware and fast software algorithms [jjl] ■ The 
software-based third-level trigger makes use of detailed 
information on the event, very similar to that available 
offline. 



III. CANDIDATE EVENT SELECTION AND 
BACKGROUNDS 

The event selection can be divided into a baseline selec- 
tion corresponding to the topology of our signal, and a va- 
riety of vetoes that are imposed to remove backgrounds. 
The baseline selection, the relevant backgrounds, and the 




FIG. 2: Decay of WW/WZ events to a charged lepton, neu- 
trino, and two quarks. 

vetoes are all described in more detail below. 

A few relevant quantities for the event selection are 
defined here. The transverse momentum of a charged 
particle is px = psin#, where p is the momentum of 
the charged particle track. The analogous quantity mea- 
sured with calorimeter energies is the transverse energy, 
Et = Esm6. The missing transverse energy, ^r, is 
defined by Et = — X^i Etpfii , where hi is a unit vector 
perpendicular to the beam axis and pointing at the i th 
calorimeter tower. Mt is corrected for high-energy muons 
as well as for factors applied to correct hadronic jet ener- 
gies. We define $ T = \Bt\- Jets are clustered using a cone 
algorithm, with a fixed cone size in which the center of 
the jet is defined as {jf et , <fi> et ) and the size of the jet cone 

as AR = ^(rfower _ ^jety _ ^tower _ fijety < Q.4. 

A. Baseline event selection 

Figure [5] shows the WW/WZ decay topology that is 
considered as the signal in this analysis. The final state 
contains a charged lepton, a neutrino, and two quarks. 
We focus on events in which the charged lepton is an 
electron or muon. Events in which the W boson decays 
to a r lepton may also be considered part of the sig- 
nal if a leptonic r decay results in an isolated electron or 
muon. The neutrino passes through the detector without 
depositing energy; its presence can be inferred in events 
with The two quarks will hadronize to form colli- 
mated jets of hadrons. As a result, our baseline event 
selection requires events to contain one high-p^ electron 
or muon, significant j^y, and two jets. 

Several triggers are used to collect events for this anal- 
ysis. Roughly half of the events are selected with a 
trigger requiring a high-py central electron in the CEM 
(Et > 18 GeV, |?7| < 1.0). Two muon triggers, one re- 
quiring hits in both the CMP and CMU and the other 
requiring hits in the CMX, collect events with central 
muons (pt > 18 GeV/c, |?7| < 1.0). Finally, a trigger 
path requiring large $t and two jets is used to collect 
events with muons that were not detected by the cen- 
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tral muon triggers. The $t plus jets trigger requires 
$ T > 35 GeV and two jets with E T > 10 GeV. The jet 
Et and $t used in the trigger selection are not corrected 
for detector or physics effects. 

Further selection criteria are imposed on triggered 
events offline. Electron (muon) candidates are required 
to have E T > 20 GeV (p T > 20 GcV/c). They must 
fulfill several other identification criteria designed to se- 
lect pure samples of high-py electrons (muons) [l9[ , in- 
cluding an isolation requirement that the energy within 
a cone of AR < 0.4 around the lepton axis is less than 
10% of the Et (pt) of the electron (muon). The jet ener- 
gies are corrected for detector effects [20| ■ We require the 
highcst-£r jet in the event to have Et > 25 GeV and the 
second highcst-Er jet in the event to have Et > 20 GeV. 
Finally, we require $t > 20 GeV. 

Some criteria are imposed specifically on events col- 
lected by the $t plus jets trigger to ensure a high effi- 
ciency. We require that the two jets are sufficiently sepa- 
rated, AR > 1, that one of the jets is central, \r]j e t\ < 0.9, 
and that the transverse energy of both jets is larger than 
25 GeV. Even after these cuts, this trigger path is not 
fully efficient, which is taken into account by a correc- 
tion curve as a function of $t- 

B. Backgrounds 

The baseline selection is based on the signal topology 
we are trying to select. However, several backgrounds 
can result in events with a similar topology. 

• VF+jets: events in which a W boson is produced 
in association with quarks or gluons form a back- 
ground if the W boson decays leptonically. This is 
the dominant background because of its high pro- 
duction cross section and signal-like properties. 

• Z+jets: events in which a Z boson is produced in 
association with two quarks or gluons may enter our 
signal sample if the Z boson decays to electrons or 
muons and one lepton falls outside the fiducial re- 
gion of the detector or other mismeasurement leads 



to significant $t- 

• QCD non-VF: events in which several jets arc 
produced, but no real W boson is present, may 
form a background if a jet fakes an electron or 
muon and mismeasurement of the jet energies 
results in incorrectly assigning a large $t to the 
event. 



• ti: top quark pair production is a background be- 
cause top quarks nearly always decay to a W boson 
and a b quark. If a W boson decays leptonically, ti 
events may pass our baseline event selection crite- 
ria. 

• Single top: leading-order production and decay of 
single top quarks results in an event topology with 
a W boson and two quarks. 



C. Event vetoes 

In order to reduce the size of the backgrounds de- 
scribed above, several vetoes are imposed on events in our 
sample. Events are required to have no additional elec- 
trons, muons, or jets, reducing the Z+jets, QCD non-W, 
and ti backgrounds. A further Z+jets veto rejects events 
with a second loosely identified lepton with the opposite 
charge as the tight lepton if the invariant mass of the 
tight and loose lepton pair is close to the Z boson mass: 
76 < Mu < 106 GeV/c 2 . ZZ events are also effectively 
removed after this veto. 

A veto developed specifically to reduce the size of the 
QCD non-VF background is imposed. This veto is more 
stringent for events which contain an electron candidate, 
since jets fake electrons more often than muons. In elec- 
tron events, the minimum $t is raised to 25 GeV, and 
the transverse mass of the leptonically decaying W bo- 
son candidate, Mr(W) = ^2pt,^t(1 — cos(A0 £ ^J)j 

is required to be at least 20 GeV/c 2 . A variable called 
the $t significance is also defined: 



/£ ^JES COS 2 (A0. ct + COS 2 (A^ r " Jt ) ]T £T,uncl ' 

V jets 



where E^™ is the raw, uncorrected energy of a jet and 
Cjes is the correction to the jet energy (20|, -Er.unci is 
the vector sum of the transverse component of calorime- 
ter energy deposits not included in any jet, and ^ Et,uhc\ 
is the total magnitude of the unclustered calorimeter en- 



ergies. The $ T SIS is a measure of the distance between 
the $t and jets or unclustered energy; it tends to be 
larger for $t stemming from a neutrino than for $t 
stemming from mismeasurement. We require $t SI& > 
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0.05M T (W) + 3.5 and # T SIg > 2.5 - 3.125A</>^ in 
events with an electron candidate. In muon events, the 
QCD veto simply requires M T (W) > 10 GeV/c 2 . 

We veto events with additional "loose" jets, defined as 
jets with E T > 12 GcV and \r]\ < 2.0. This veto is found 
to improve the agreement between Monte Carlo and data 
in the modeling of some kinematic variables. 

Events consistent with photon conversion and cosmic 
ray muons are also vetoed (2lj . 

IV. MODELING 

Both the normalization (number of events in our sam- 
ple) and the shapes of signal and background processes 
must be understood to carry out this analysis. 

A. Models used 

The signal processes and all background processes ex- 
cept the QCD non-FT background arc modeled using 
events generated by a Monte Carlo program which are 
run through the CDF II detector simulation [22| . The 
Monte Carlo event generators used for each process are 
listed in Table HI pythia is a leading-order event gen- 
erator that uses a parton shower to account for initial 
and final state radiation [23j | . ALPGEN and madevent 
are leading-order parton- level event generators [24|, [2{| ; 
events generated by ALPGEN and madevent are passed 
to pythia where the parton shower is simulated. 

The top mass is assumed to be 175 GeV/c 2 in the mod- 
eling of tt and single top events. The distributions of the 
longitudinal momenta of the different types of quarks and 
gluons within the proton as a function of the momentum 
transfer of the collision are given by parton distribution 
functions (PDFs). The CTEQ5L PDFs are used in gen- 
erating all Monte Carlo samples in this analysis [26j . 

TABLE I: Monte Carlo programs used to generate events for 
signal and background processes. 



Sample Generator 

WW PYTHIA 

WZ PYTHIA 

VK+jetS ALPGEN + PYTHIA 

Z+jets ALPGEN + PYTHIA 

tt PYTHIA 

Single top madevent + pythia 



Simulation of the QCD non-VF background is difficult: 
its production cross section is large and the probabil- 
ity to mimic a W boson in the event is small. In addi- 
tion, the mismeasurements that lead to the QCD non-14^ 
background having large $t may not be simulated well. 
Therefore, this background is modeled using data rather 
than simulation. Events from jet-based triggers contain- 
ing a jet that deposits most of its energy in the electro- 



magnetic segment of the calorimeter, as well as events 
from single lepton triggers that fail lepton requirements 
but pass a looser set of requirements are used. 



B. Expected event yields 

The number of events due to the signal and Z+jets, tt 
and single top backgrounds that enter our sample are es- 
timated based on their cross section (a), the efficiency (e) 
with which they are selected, and the integrated luminos- 
ity (£): N — eCa. The efficiency e, which includes the 
detector acceptance, is estimated from the Monte Carlo 
simulation, a is taken from NLO calculations for the 
WW, WZ, and single top processes and from the CDF 
inclusive Z boson production cross section measurement 
for the Z+jets background [27j. 

As mentioned in the introduction, the WW and WZ 
cross sections calculated at NLO are 11.66 ±0.70 pb and 
3.46 ± 0.30 pb respectively Q. The acceptance of these 
samples measured with respect to the inclusive produc- 
tion cross section is about 2.4% for WW events and about 
1.2% for WZ events. 

Since neither the production cross section nor the selec- 
tion efficiency of the QCD non-W background is known, 
we rely on a data-driven technique to estimate its normal- 
ization. The shape of the $t spectrum is very different 
in events with a real W boson than in the events com- 
ing from the QCD non-11^ background, as is shown in 
Fig. [3J The $t spectrum observed in data is fit with the 
sum of all contributing processes, where the QCD non-iy 
normalization and the VF+jets normalization are free pa- 
rameters. The fit is performed over < $t < 120 GeV, 
meaning the cut on the $t described in the event selec- 
tion above is removed. An example of the fit is shown in 
Fig. [3] for events with a central electron. The percentage 
of QCD nonW-^ events in our signal sample (with the Bt 
cut imposed) is estimated based on the fit; it is about 5% 
for events with a central electron, 3% for events with a 
central muon, and 3% for events in the extended muon 
category. 

The IV+jets normalization is a free parameter in the 
final likelihood fit to extract the WW+WZ cross section, 
which is described in Section VC. A preliminary estimate 
of the W^+jets normalization used in the modeling vali- 
dation is derived from the $t fit described above. Ta- 
ble |ll] lists the total expected number of events for signal 
and background processes. The background normaliza- 
tion uncertainties will be described in Sec. I VII 



C. Background shape validation 

The kinematics of the background model are validated 
by comparing the shape of various kinematic quantities 
in data to the prediction from the models. Each signal 
and background process is normalized according to Ta- 
ble HO and the sum of their shapes for a given quantity 
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FIG. 3: Fit to to determine the contribution from the 
QCD non-W 7 background for events containing a central elec- 
tron. 



TABLE II: Expected number of events for each signal and 
background process. 



Process 



Predicted number of events 



WW signal 

WZ signal 

W+jets 

Non-W 

Z+jets 

tt 

Single top 



1262 
191 
35717 
1515 
1680 
285 
267 



110 
21 

7143 

606 

220 

38 

40 



is compared to that observed in the data. Some exam- 
ples of the comparisons are shown in Fig. |4] for the ^t, 
the lepton Ex, the E? and rj of both jets, the distance 
between the two jets (ARjj), and the px of the two- 
jet system (pTjj). In all of these figures, the integral of 
the total expectation is set to be equal to the number of 
data events, so the figures show shape comparisons. The 
hatched band is the uncertainty in the shape of the back- 
grounds due to the jet energy scale and the Q 2 scale in 
ALPGEN, described further in Sec. IVI1 The modeling of 
the kinematic quantities generally matches the data well 
within the uncertainties. In the case oipTjj, the system- 
atic uncertainties do not seem to cover the disagreement 
between data and Monte Carlo, so an additional mismod- 
eling uncertainty is imposed; this is described further in 
Sec. lVIl The mismodeling uncertainty derived from pxjj 
also affects the modeling of correlated variables, particu- 
larly ARjj and Mjj , covering the observed disagreement 



between data and expectation. 



V. MEASUREMENT TECHNIQUE 

The expected number of events from WW + WZ pro- 
duction is small compared to the expected number of 
events from VF+jets production. Moreover, the uncer- 
tainty on the number of W^+jets events expected is large 
due to uncertainty in the modeling of this process, mak- 



ing it difficult to separate the WW + WZ signal from 
the VF+jets background. We employ a matrix element 
analysis technique to improve the signal and background 
separation. Matrix element probabilities for various pro- 
cesses are calculated which are then combined to form a 
single discriminant. 



A. Matrix element event probability 

The matrix element method defines a likelihood for an 
event to be due to a given production process based on 
the differential cross section of the process. An outline of 
the procedure is given here, and full details can be found 
in Ref. 

The differential cross section for an n-body final state 
with two initial state particles with momenta q{ and (ft 
and masses mi and m,2 is 



da = 



(2tt) 4 |A^ 



X (f$„ 



4\/ (ql • <ft) 2 ~ m\m. 
where d<f> n is a phase space factor given by 



5%i 



<12 



i=l 



d 3 Pt 



LA (2^)32^ 



(2) 



(3) 



and Ei and pi are the energies and momenta of the fi- 
nal state particles [2!|. M. is the matrix element of the 
process. 

We define a probability density for a given process by 
normalizing the differential cross section to the total cross 
section: 



P 



da 
a 



(4) 



P is not a true probability, as various approximations are 
used in the calculation of the differential cross section: 
leading-order matrix elements are used, there are inte- 
grations over unmeasured quantities (described below), 
and several constants are omitted from the calculation. 

We cannot measure the initial state momenta and the 
resolution of the final state measurements is limited by 
detector effects. As a result, we weight the differential 
cross section with parton distribution functions (PDFs) 
for the proton and integrate over a transfer function en- 
coding the relationship between the measured quantities 
x and the parton-levcl quantities y. The probability den- 
sity is then given by 



P(x) = ^ J da(y)dq 1 dq 2 f(q 1 )f(q 2 )W(y,x), 



(5) 



where f(qi) and /((ft) are the PDFs in terms of the frac- 
tion of the proton momentum (<& = E q i/ E\, eam ), and 
W(y,x) is the transfer function. The PDFs are evalu- 
ated based on the CTEQ6.1 parameterization (2(|. Using 
Eqs. [5] and [3] and neglecting the masses and transverse 
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FIG. 4: Comparison of shapes between data and models for various kinematic quantities. The shaded region includes the effect 
of the major systematic uncertainties: the jet energy scale, JES, and the renormalization scale, Q 2 . 
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momenta of the initial partons, the event probability is 
given by 



P(x) 



1 



2ir 4 \M\ 



\E ql \ \E, 



92 1 



W(y,x)d^ 4 dE ql dE q2 . 



(6) 

The squared matrix element, LM| 2 , is calculated at tree 
level using the HELAS package [301 ! w ith the diagrams for 
a given process provided by MADGRAPH [25j |. 

In W(y,x), the lepton energy and angle, as well as 
the jet angles, are assumed to be measured exactly. The 
jet energy transfer function is derived by comparing par- 
ton energies to the fully simulated jet response in Monte 
Carlo events. A double Gaussian parameterization of 
the difference between the jet and parton energy is used. 
Three different transfer functions are derived: one for jets 
originating from b quarks, one for jets originating from 
other non-& quarks, and one for jets originating from glu- 
ons. The appropriate transfer function is chosen based 
on the diagram in the matrix element being evaluated. 
The measured missing transverse energy is not used in 
the calculation of the event probability; conservation of 
momentum is used to determine the momentum of the 
neutrino. 

After conservation of energy and momentum have been 
imposed, the integral to derive the event probability is 
three-dimensional: the energies of the quarks and the 
longitudinal momentum of the neutrino arc integrated 
over. The integration is carried out numericall y u sing an 
adaptation of the CERNLIB radmul routine [3lJ or the 
faster DIVONNE integration algorithm implemented in the 
CUBA library [32j . The results of the two integrators were 
checked against each other and found to be compatible. 



B. Event Probability Discriminant 

The matrix element event probability is calculated for 
the signal WW and WZ processes, as well as for single 
top production and several contributions to the VF+jets 
background: Wgg, Wgq, Wbb, Wcc, and Wcg, where g, 
q, b, and c are gluons, light flavor quarks, bottom quarks, 
and charm quarks respectively. 

No matrix clement calculation is carried out for the ti, 
Z+jets, and QCD non-W background processes. All of 
these backgrounds require some additional assumptions, 
making the matrix element calculation more difficult and 
computationally intensive. For example, tt events be- 
come a background if several jets or a lepton arc not 
detected; incorporating this in the matrix element cal- 
culation requires additional integrations which are com- 
putationally cumbersome. For the Z+jets background 
process, a lepton either fakes a jet or escapes detection, 
two scenarios difficult to describe in the matrix element 
calculation. Finally, the QCD non-M^ background would 
require a large number of leading-order diagrams as well 
as a description of quarks or gluons faking leptons. The 
■2f+jets and QCD backgrounds look very different from 



the signal (i.e. there will be no resonance in the dijet 
mass spectrum) so we expect good discrimination even 
without including probabilities explicitly for those back- 
ground processes. 

The probabilities for individual processes described 
above (Pi, where i runs over the processes) are com- 
bined to form a discriminant, a quantity with a dif- 
ferent shape for background-like events than for signal- 
like events. We define the discriminant to be of the 
form P s ig na i/ (background + Signal) so that background- 
like events will have values close to zero and signal-like 
events will have values close to unity. The P s i gna i and 
^background are just the sum of individual probabilities 
for signal and background processes, but we put in some 
additional factors to form the event probability discrim- 
inant, or EPD. 

First, as noted above, various constants are omitted 
from the calculations of Pi . Wc normalize the Pi relative 
to each other by calculating them for each event in large 
Monte Carlo samples. We then find the maximal Pi over 
all Monte Carlo events corresponding to a given process, 
pmax rpj^g normalized probabilities are then given by 

iy p i max - 

In addition, we multiply each P, by a coefficient, Cj. 
This coefficient has the effect of weighting some probabil- 
ities more than others in the discriminant. These Ci are 
optimized to achieve the best expected sensitivity based 
on the models. The full EPD is then given by: 



E 



CiPi 

pmax 



EPD = 



■-Big n p «BG r p 

E % 1 1 \ W i 
pi 

i=i 



(7) 



E 



max ' / j pmax 

j'=i i 



where the summation over signal processes runs over 
WW and WZ (n s i g = 2) and the summation over back- 
ground processes runs over Wgg, Wgj, Wbb, Wcg, and 
the single top diagrams (tt-bg = 6) . 

Figure[S]shows the EPD templates for signal and back- 
ground processes normalized to unit area. The back- 
ground processes all have similar shapes while the sig- 
nal process falls more slowly. Wc validate the modeling 
of the EPD for background events by comparing data 
and simulation in the region with Mjj < 55 GcV/c 2 and 
Mjj > 120 GeV/c 2 , where we expect very little signal. 
The result of the comparison is shown in Fig. |S1 The 
agreement between data and simulation is very good. 

The effectiveness of the EPD in isolating signal-like 
events can be seen by plotting the invariant mass of the 
two jets in EPD bins, shown in Fig. [7j This quantity is 
expected to have a resonance around the W or Z boson 
mass for signal-like events. The bin with low EPD values 
(0< EPD <0.25), in the top left plot, has events in the 
full dijet mass range from 20 to 200 GeV/c 2 . For EPD > 
0.25, however, the Mjj distribution is peaked around the 
W/Z mass. As the EPD range approaches unity, the 
expected signal to background ratio increases and the 
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Event Probability Discriminant 



FIG. 5: Shape of the EPD for signal and background pro- 
cesses normalized to unit area. 




Event Probability Discriminant 



FIG. 6: Comparison of the EPD in data and simulation for 
events with Mj, < 55 GeV/c 2 or Mjj > 120 GeV/c 2 . 

dijet mass peak becomes narrower. 

C. Likelihood fit 

The shape of the EPD observed in data is fit to a sum 
of the templates shown in Fig. [5] to extract the signal 
cross section. The events are divided into three channels 
corresponding to different lepton categories: one channel 
for central electrons, another for central muons, and a 
third for events with muons collected by the $t plus jets 
trigger. 

A maximum likelihood fitting procedure is used. The 
likelihood is denned as the product of Poisson probabili- 
ties over all bins of the EPD template over all channels: 

L=U^, (8) 

2—1 1 

where rii and \ii are the observed and predicted number 
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U, (GeV/c 2 ) M, (GeV/c 2 ) 

FIG. 7: Distribution of the dijet mass in four EPD bins. 

of events in bin i respectively. The prediction in a bin is 
the sum over signal and background predictions: 

n sig n bg 

Mi = 2J Sik + X] bik ( 9 ) 

fe=l fe=l 

with bik the predicted contribution from background k 
in bin i. n s i g is two, corresponding to the WW and WZ 
processes; nb g is the number of background processes. 

The predicted number of events in a bin is affected 
by systematic uncertainties. The sources of systematic 
uncertainty are described in detail in Section I VI I For 
each source of uncertainty, a nuisance parameter is intro- 
duced whose value changes the predicted contribution of 
a process to a bin. Each nuisance parameter has a Gaus- 
sian probability density function (p.d.f.) with a mean 
of zero and a width given by the la uncertainty. A de- 
tailed mathematical description of the way the nuisance 
parameters are incorporated in the likelihood is given in 
Ref. [2l|. 

Finally, with a likelihood that is a function of the ob- 
served data, the signal cross section, the predicted signal 
and background contributions, and systematic uncertain- 
ties and their corresponding nuisance parameters, we ex- 
tract the cross section. A Bayesian marginalization tech- 
nique integrates over the nuisance parameters, resulting 
in a posterior probability density which is a function of 
the signal cross section. The measured cross section cor- 
responds to the maximum point of the posterior prob- 
ability density, and the 68% confidence interval is the 
shortest interval containing 68% of the area of the pos- 
terior probability density. 

The measured cross section is the total cross section 
of the signal, aww+wz ■ Assuming the ratio between 
the WW and WZ cross sections follows the NLO pre- 
diction, crww+wz = oww + &wz- If the ratio between 
the cross sections is different than the NLO prediction, 
we are measuring the total cross section aww+wz = 
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l-13a' ww + 0.56a' wz . Here cr' ww and a' wz arc not as- 
sumed to follow NLO predictions. The ratio between the 
WW and WZ acceptances is predicted from the signal 
simulations described in Sec. IVA. 



VI. SYSTEMATIC UNCERTAINTIES 

Systematic uncertainties affect the normalization of 
background processes, the signal acceptance, and the 
shape of the EPD for both background and signal pro- 
cesses. The sources of systematic uncertainty and the 
aspects of the measurement affected by each are briefly 
described in this section. Finally, the expected contribu- 
tion of the uncertainties to the WW + WZ cross section 
measurement are explored. 



A. Sources of uncertainty 

Normalization of background processes: The 

uncertainties in the normalization of the back- 
ground processes are summarized in Table UTTl The 
uncertainty on the W+jets normalization is taken 
to be an arbitrarily large number; the fit to extract 
the cross section constrains the VF+jets normaliza- 
tion to a few percent, so taking a 20% uncertainty 
is equivalent to allowing the H^+jets normalization 
to float. The uncertainty on the 2+jets, ti, and 
single top backgrounds are derived from the un- 
certainty in their cross sections and uncertainties 
on the efficiency estimate. The 40% uncertainty 
on the QCD non-VF contribution is a conservative 
estimate based on differences observed between dif- 
ferent choice of sample models. 



TABLE III: Uncertainties in the background normalizations. 



Background 


Normalization uncertainty 


W+jets 


20% 


Z+jcts 


15% 


QCD non-W 


40% 


ti and single top 


12% 



• Jet Energy Scale (JES): As mentioned above, 
jet energies are corrected for detector effects. The 
corrections have systematic uncertainties associ- 
ated with them [2(J. The size of the la uncertainty 
depends on the Et of the jet, ranging from about 
3% for jet E T ~ 80 GeV to about 7% for jet E T -20 
GcV. 

The effect of the JES uncertainty on the measure- 
ment is estimated by creating two shifted Monte 
Carlo samples: one in which the energy of each jet 
in each event of our Monte Carlo samples is shifted 
by +1(7 and the second in which each jet energy is 
shifted by — la, taking the ^-dependence of the 



uncertainty into account. The whole analysis is re- 
peated with the shifted Monte Carlo samples, in- 
cluding the calculation of the matrix elements. 

The JES uncertainty has a small effect on the es- 
timated signal acceptance because the efficiency of 
the jet Et selection depends on the JES. The size 
of the acceptance uncertainty is about 1%. In ad- 
dition, the shape of the EPD templates for the sig- 
nal processes and for the dominant VF+jets back- 
ground process are affected by the JES uncertainty. 
The change in the background shape is relatively 
small compared to the change in the signal shape. 
The signal normalization uncertainty, the signal 
shape uncertainty, and the background shape un- 
certainty are incorporated as a correlated uncer- 
tainty in the likelihood fit. 

• Q 2 scale in ALPGEN: The factorization and renor- 
malization scale, or Q 2 scale, is a parameter in the 
perturbative expansion used to calculate matrix el- 
ements in ALPGEN. Higher-order calculations be- 
come less dependent on the choice of scale, but 
ALPGEN is a leading-order generator and its mod- 
eling is affected by the choice of scale. The scale 
used in generating our central M^+jets samples is 
Q 2 = m 2 v + Srriy, where mw is the mass of the W 
boson, m,T is the transverse mass, and the summa- 
tion is over all final-state partons. ALPGEN VF+jets 
samples were generated with this central scale dou- 
bled and divided by two. These are taken as ±la 
uncertainties on the shape of the VF+jets EPD 
template. 

• Integrated luminosity: The integrated luminos- 
ity is calculated based on the pp inelastic cross sec- 
tion and the acceptance of CDF's luminosity mon- 
itor [33j]. There is a 6% uncertainty on the cal- 
culation, which is included correlated uncer- 
tainty on the normalization of all processes except 
the non-VF QCD background and the VF+jets back- 
ground, whose normalizations are determined from 
fits to the data. 

• Initial and final state radiation: Comparison 
between samples simulated with pythia and Drcll- 
Yan data, where no FSR is expected, are used to 
determine reasonable uncertainties for the param- 
eters used to tune the initial and final state radi- 

The signal WW and WZ 



34 



ation in pythia 
samples were generated with the level of ISR and 
FSR increased and decreased, and the change in 
the acceptance was estimated. This results in an 
uncertainty of about 5% on the signal acceptance. 

PDFs: The PDFs used in generating the Monte 
Carlo samples have some uncertainty associated 
with them. The uncertainty on the signal accep- 
tance is estimated in the same way as in Ref. [34J. 
The uncertainty in the signal acceptance is found 
to be 2.5%. 
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• Jet Energy Resolution (JER): A comparison 
between data and simulation is used to assign an 
uncertainty on the jet energy resolution (35j. For 
a jet with measured Et of 40 GeV, the jet energy 
resolution is (13 ± 7)%. The matrix element cal- 
culations are repeated for the signal Monte Carlo 
sample with a higher jet energy resolution, and no 
change in the shape of the EPD is observed. A 
small (~ 1%) uncertainty on the signal acceptance 
is assigned. 

• W+jets modeling: In addition to the shape un- 
certainties on the W+jets EPD due to the JES 
and Q 2 scale, we impose shape uncertainties due 
to mismodeling of the px of the dijet system (pTjj) 
and the 77 of the lower-p^ jet in the event (^2)- 
We derive the uncertainty due to the mismodel- 
ing of these variables by reweighting the W+jets 
Monte Carlo model to agree with data as a function 
of either PTjj or 77^2 - When deriving the weights, 
we remove events with 55 < Mjj < 120 GeV/c 2 
(the region in which we expect most of the signal) 
to avoid biasing the measurement towards the ex- 
pected result. The mismodeling of 77^2 has a negli- 
gible effect on the shape of the EPD, whereas the 
mismodeling of PTjj has a small effect on its shape. 

• Lepton identification efficiency: There is a 2% 
uncertainty on the efficiency with which we can 
identify and trigger on leptons. This uncertainty 
is assigned in the same way as the uncertainty on 
the integrated luminosity. 



B. Effect on cross section fit 

Pseudoexperimcnts are carried out to determine the 
expected uncertainty on the WW + WZ cross section. 
The pseudoexperiments are generated by varying the bin 
contents of each template histogram according to Poisson 
distributions as well as randomly setting a value for each 
nuisance parameter according to its p.d.f. The likelihood 
fit is applied to each pscudocxpcrimcnt to extract the 
WW + WZ cross section. 

In order to estimate the effect of certain systematic un- 
certainties, they are taken out of the pseudo-experiments 
one-by-one. The expected statistical uncertainty (includ- 
ing the uncertainty on the background normalizations) 
was found to be 14% while the total systematic uncer- 
tainty is expected to be 16%. The total (systematic plus 
statistical) uncertainty expected on the WW + WZ cross 
section is 21%. The largest predicted systematic uncer- 
tainties are the JES, Q 2 scale, and luminosity uncertain- 
ties, which contribute 8%, 7%, and 6% respectively to 
the total (Tww+wz uncertainty. 

Based on the pseudoexperimcnts, we can also under- 
stand which nuisance parameters are constrained in the 
likelihood fit. The W+jets normalization uncertainty, 



which has a width of 20% in the prior p.d.f., is con- 
strained on average to 1.8% in the pseudoexperiments. 
The first few bins of the EPD, which are dominated by 
the W+jets contribution, establish this constraint, and 
the effect of the constraint is to reduce the uncertainty in 
the W+jets normalization in the high- EPD bins, which 
are most important to the signal extraction. 

VII. RESULTS 

The likelihood fit is carried out in a data sample cor- 
responding to an integrated luminosity of 4.6 fb _1 . The 
shape of the EPD observed in data is shown superim- 
posed on the shape expected from Monte Carlo in Fig. [3] 
The cross section for WW + WZ production is found 
to be a(pp WW + WZ) = 17 A ± 3.3 pb. This re- 
sult agrees with the prediction from NLO calculations of 
15.1 + 0.9 pb. 




Event Probability Discriminant 

FIG. 8: Stacked EPD templates with data superimposed. 

The cross section was extracted in each lepton channel 
separately as a cross-check. The results are listed in Ta- 
ble |IV] The extracted cross section agrees across lepton 
channels. 



TABLE IV: Fitted WW+WZ cross section in the three lepton 
categories and in the whole sample. 



Category 


Cross section [pb] 


Central electrons 




Central muons 


19.8+54 


Extended muons 


10.7±£° 


All 


17.4 + 3.3 



VIII. FIT TO THE DIJET INVARIANT MASS 

A similar template fit to the one described above was 
carried out using the invariant mass of the two jets rather 
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than the EPD with exactly the same event selection and 
sources of systematic uncertainty. The distribution of 
Mjj in data is shown superimposed on the stacked pre- 
dictions in Fig. IH1 The templates for the fit are shown in 
Fig. \W\ There is a resonance for the WW + WZ signal 
since the two jets are a product of IU or Z boson decay, 
while the backgrounds have very different shapes with- 
out an apparent resonance. The shape of the W/Z+']ets 
background is a falling distribution shaped by event se- 
lection cuts. 



-•- Data, L = 4.6 fb' 1 

■ ww+wz 

I I W/Z+jets 
^| QCD non-W 

tt + single top 
Syst. uncertainty 




150 

[GeV/c 2 ] 

FIG. 9: Distribution of Mjj in data superimposed on Monte 
Carlo prediction. The shaded regions include JES, Q 2 , and 
VF+jets modeling systematic uncertainties. 
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o 0.05 - 




FIG. 10: Shape of Mjj templates for signal and background 
processes normalized to unit area. 

The expected uncertainty on the WW + WZ cross 
section extracted by a fit to Mjj is about 19%, lower 
than the expected uncertainty when fitting the EPD. 
While the statistical uncertainty is larger when fitting 
Mjj than when fitting the EPD, the systematic uncer- 
tainty is smaller. The dominant systematic uncertainty 
is expected to be the shape uncertainty on the VF+jets 
background due to the mismodeling of PTjj, while the 



JES and Q 2 scale uncertainties are less important than 
when fitting the EPD. 

The WW + WZ cross section extracted from the fit 
to Mjj is 12.4^3g pb. Based on pseudo-experiments, the 
expected correlation between the fit to Mjj and the fit 
to EPD is about 60%. Thus the cross sections extracted 
from the EPD and the Mjj fits have a discrepancy of 
about 1.8(7. 

Fitting the dijet mass is presented here as a cross-check 
to the result from the matrix element technique because 
it is a less sensitive way of extracting the signal. In other 
words, the expected probability that the signal can be 
faked by the background is higher when fitting the dijet 
mass than when fitting the EPD. As a result, the first 
observation of the WW + WZ signal in this channel was 
provided by the matrix element technique Q. With the 
data sample presented in this paper, the expected sen- 
sitivity of the matrix element technique is 5. Oct, while it 
is 4.6ct when fitting Mjj. The observed significances are 
5.4ct and 3.5ct for the matrix element and Mjj analyses 
respectively. 



IX. CONCLUSIONS 

We have extracted the cross section for WW + WZ 
production in the final state with a lepton, two jets, and 
missing transverse energy using a matrix element tech- 
nique. The cross section is measured to be 17.4 ±3.3 pb, 
in agreement with the NLO theoretical prediction of 
15.1 ± 0.9 pb. The measurement is primarily system- 
atically limited; the jet energy scale and Q 2 scale uncer- 
tainties give both large contributions to the total uncer- 
tainty. Improvements to the cross section measurement 
could be achieved by reducing the size of the system- 
atic uncertainties via data-driven methods. The effect of 
systematic uncertainties on the measurement could also 
be reduced by further optimization of the event selection 
and discriminant. 
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